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I. THe MuLtieLe ABSOLUTE 
PREDICTION PROBLEM 


N A previous report entitled ‘‘A Tech- 
I nique for the Development of a Differ- 
ential Prediction Battery” (1) we have 
presented a technique for selecting from a 
battery of predictors a subset of specified 
size which will have the highest differen- 
tial prediction efficiency for a given set of 
criterion variables. In subsequent refer- 
ences to this report it will be designated 
merely as ‘Differential Prediction.”’ Dif- 
ferential prediction efficiency was defined 
in terms of the accuracy with which 
differences between all possible pairs of 
criterion measures could be predicted. 

One may also be concerned with a tech- 
nique for selecting from a battery of po- 
tential predictors that subset of specified 
size which will have the highest prediction 
efficiency for all the criterion variables 
irrespective of how well it differentiates 
among them. To distinguish between the 
two types of multiple prediction we shall 
call the former multiple differential pre- 
diction and the latter multiple absolute 
prediction. We may define the efficiency 


' This research was carried out under Contract 
Nonr-477(08) between the University of Washing- 
ton and the Office of Naval Research. The data used 
to illustrate the technique were provided by Pro- 
fessor August Dvorak. Computations were carried 
out by Robert Dear and William Meredith. Char- 
lotte MacEwan assumed major editorial responsibil- 
ity for the preparation of the manuscript. Much 
credit is due the typist, Eleanor Green. Supervision 
of both computational and editorial activities was 
provided by William Clemans. To each of these able 
contributors I am deeply grateful. 


of multiple absolute prediction in terms 
of the accuracy with which all of the cri- 
teria are predicted, irrespective of how 
well the selected battery differentiates 
among the criteria. 

In Differential Prediction we showed 
that an appropriate measure of the pre- 
diction efficiency of a selected battery is 
the difference between the average vari- 
ance of the predicted criteria and their 
average covariance. In the case of multi- 
ple absolute prediction we may take as a 
measure of the prediction efficiency of the 
selected battery the sum of the variances 
of the predicted criteria irrespective of 
their average covariances. In the case of 
standard criterion measures such a meas- 
ure can readily be shown to be the sum 
of the squares of the multiple correlations 
of the criteria with the predictors, 

In both types of multiple prediction we 
consider only the case of standard cri- 
terion measures. However, it is a simple 
matter to weight the criterion variables 


differentially if such weights are available, 
so that the variances of the criterion vari- 
ables would be proportional to the squares 


of the weights rather than all equal to 
unity. 

As it turns out, in both differential and 
absolute prediction the end result is a set 
of predictions of success in each of the cri- 
terion activities. Furthermore, the weights 
to be used in both cases are the conven- 
tional] least-square regression weights, The 
essential difference between the two cases 
is that a different set of predictors is sel- 
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ected for the two types of prediction. 

There may, however, be a great dea! of 
overlap between the two sets of predic- 
tors, depending on the original group of 
potential predictors from which the subset 
is taken and the correlations among the 
criterion variables. Nevertheless, it was 
noted in the case of differential prediction 
that correlations among the criterion vari- 
ables are not required by the selection 
procedure. Satisfactory estimates of the 
correlations of each criterion with the 
predictors are required. However, this is 
also true for the case of multiple absolute 
prediction. 

In the selection procedure for the differ- 
ential prediction battery one predictor at 
a time was selected. The first predictor 
selected was the one which according to 
the definition gave the highest index of 
differential prediction efficiency. The next 
predictor was the one which in combina- 
tion with the first gave the highest index 
of differential prediction efficiency. Thus 
predictors are selected one at a time until 
the prespecified number has been selected. 
The assumption here is that this pro- 
cedure of selecting a set of specified size 
yields an index of differential prediction 
efficiency sufficiently close for practical 
purposes to that of the particular set of 
specified size which, for the particular 
data, would yield the highest index of 
differential prediction efficiency. 

Similarly, one test at a time is selected 
for the absolute prediction battery and 
the analogous assumption is made with 
reference to the index of absolute predic- 
tion efficiency of the selected battery. 


Il. THe CompuTaATIONAL Metuop 
A. Selecting the Differential Predictors 


Assume that we have given the matrix 
of intercorrelations of m potential predic- 
tors, and the matrix of correlations of the 
n predictors with N criterion variables. 


These are the same two sets of data used 
in Differentia! Prediction. As in that re- 
port we let 

ig=the matrix of intercorrela- 
tions of the n predictors, 

\T =the (nN) matrix of correlations 
of the m predictors with the N cri- 
teria. 

Then, of course, each column of ,T repre- 
sents a criterion variable and each row a 
predictor variable. The following steps in 
the predictor selection procedure corre- 
spond to those illustrated by the numeri- 
cal example in Part III, and reference to 
the example should clarify these steps. 
For the example, the same data are used 
in this report as in Differential Prediction. 
(It will be noted that in the example, the 
:T’ matrix is used rather than ,7. In fact, 
the transpose of ;7 is used throughout in 
the numerical computations as a more 
convenient form for accommodating all 
the matrix elements and the relevant com- 
putations on one worksheet. This form is 
especially convenient if the number of 
criterion variables is large. Consequently 
operations defined in the current sections 
for rows of the T matrices have been per- 
formed, in the example, upon columns.) 

1. We start by computing the sums of 
squares of each row of the ;7 matrix and 
call these values for the kth row ,V,. The 
equation for , V, is simply 


In matrix notation Equation 1 is given by 
[2] 


where ,7,.’ is the kth row vector of ,T. 

We take the predictor variable corre- 
sponding to the largest ,V,% as the first 
predictor in the subset to be selected and 
designate this as predictor a. 

2. Next we compute a new matrix 2 
from the ig matrix and its ath row or 
column by the formula 


Bak [3] 


This step is precisely the same as for 
Differential Prediction except that the 
ath variable will not in general be the 
same for both types of selection. Equation 
3 means that the element in the kth row 
and pth column of 2g is obtained by sub- 
tracting from the corresponding element 
in yg the product of the elements in the 
kth row and ath column and the pth row 
and ath column of ig. As in the case of 
Differential Prediction the ath row and 
column of og will have all zero elements. 
In matrix notation Equation 3 is 


28 = 1818.0 18.05 [4] 
where ig. is the ath column vector of ig 
and is its transpose. 

3. Compute a 27 matrix from the ,7 
matrix and the ath row of the ,g matrix by 
means of the scalar formula, 


kp = iT iT ak ls] 


This step is the same as step 3 in Differ- 
ential Prediction except that the ath vari- 
able will not in general be the same. Equa- 
tion 5 means that the element in the kth 
row and pth column of 7 is obtained by 
subtracting from the corresponding ele- 
ment of ,7J the product of the element in 
the ath row and kth column of ,7 by the 
element in the ath row and pth column of 
ig. As in Differential Prediction the ath 
row of 27 will have all zero elements. In 
matrix notation Equation 5 is 


oT = \T— a.’, [6] 


where ,7,,.’ is the ath row vector of ;T and 
ig. is the ath column vector of ig. Since 
ig is symmetrical we could, of course, take 
either its ath column vector or the trans- 
pose of its ath row vector. 

4. Compute the sum of squares for each 
row of »J obtained in step 3 by means of 
the scalar formula 


[7] 
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In matrix notation Equation 7 is 
[8] 


where the notation is analogous to that 
defined for Equation 2. It should be noted 
that step 4 is somewhat simpler than the 
corresponding step for Differential Pre- 
diction. For the latter case we compute NV 
times the variance of each row instead of 
simply the sum of squares. 

5. Next divide each of the 2V, values 
computed in step 4 by the corresponding 
diagonal element of »g obtained in step 2 
and designate the kth ratio as »A,. In 
scalar notation this is 

lo| 
2B kk 
In matrix notation Equation 9 may be 
written 


where 

D,4 is a diagonal matrix of the 2A,, 

Dy isa diagonal matrix of the diagonal 

elements of og, 

DY is a diagonal matrix of the »V,. 

Step 5 is analogous to the same step for 
Differential Prediction. The denominator 
terms for the two steps are in both cases 
the diagonal elements of the »g matrix al- 
though as previously noted the og matrix 
will not in general be the same for both 
procedures, The numerator terms, how- 
ever, are different since they come from 
step 4 which is different for the two pro- 
cedures. 

The predictor variable corresponding to 
the largest ,A, is the second selected pre- 
dictor. This we designate as predictor b. 

6. Divide each element of the bth row 
of og by its bth element and designate the 
ratio by 2G. In scalar notation this is 


[rx] 


28 bk 


bb 
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or in matrix notation 
=—- 
2h bb 


This step is precisely the same as for step 
6 in Differential Prediction except that, 
of course, the bth predictor will not in 
general be the same for both cases. 

7. Compute a new matrix ag from the og 
matrix of step 2, its bth column and .G, 
of step 6 by the scalar formula 


[x2] 


= hor Gop. [x3] 


This step is the same as in Differential 
Prediction and means that the element in 
the kth row and pth column of 3g is ob- 
tained by subtracting from the corre- 
sponding element of og the product of the 
element in the bth row and kth column of 
ag by the pth element of .G,. As in Differ- 
ential Prediction, both the ath and bth 
rows and columns of yg will have all zero 
elements. In matrix notation Equation 13 
is 


af = 28 280 Go’. [14] 


8. Compute a new matrix ;7 from the 
oT matrix, its bth row and the vector from 
step 6 by means of the scalar equation 


aT ep =2T 27 [15] 


This step is also the same as for Differen- 
tial Prediction and means that the ele- 
ment in the kth row and pth column of ;7 
is obtained by subtracting from the cor- 
responding element in 27 the product of 
the element in the bth row and kth column 
of sT by the pth element of the 2G, vector. 

As in Differential Prediction both the 
ath and bth rows of ;T will have al! zero 
elements. 

In matrix notation Equation 15 may be 
written 


aT = [16] 


g. Compute the sums of squares for 


each row of sT obtained in step 8 by means 
of the scalar equation 


[17] 
which in matrix notation is 


x.’ aT... 


[18] 


10. Divide each ,V, computed in step 9 
by the corresponding diagonal element of 
ag obtained in step 7 and designate the 
kth ratio as 34,. In scalar notation this is 


[19] 
a8 


or in matrix notation 
Dy’, 


[20] 


where the notation is as defined for Equa- 
tion 10. The variable corresponding to the 
highest ,A, is the third or kth predictor to 
be selected. 

11. Repeat steps 6 through 10 to get 
the fourth and subsequent predictors of 
the subset. If we let w be the ith selected 
predictor, then the general equations 
representing the five steps required for 
selecting all predictors after the first two 
are given in scalar notation by the follow- 
ing five equations respectively. 


(1) Gwk= ’ [or] 

(2) (i+ = ikkp— Lop, [22] 

(3) (+1) iT iT or Gop, [23] 

(4) 41) Vie =Z Te’, [24] 

(5) +1) 4e= [25] 
(i+ 1) 


The first three of these steps, as previously 
indicated, are the same as for Differential 
Prediction. 

In matrix notation the corresponding 
steps are 


(1) 


126] 
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[27] 
[28] 
[29] 


[30] 


(2) (i+ 18 = Ge’. 

(3) Gu iT 


As in the case of Differential! Prediction 
each new ,T matrix has one more vanish- 
ing row than the preceding one and each 
new ,g matrix has one more vanishing row 
and column than the preceding one. 

Steps 6 through 10 may be repeated 
until the desired number of predictors has 
been selected. This number may be deter- 
mined on the basis of administrative con- 
siderations such as the total amount of 
testing time available or limitations on the 
facilities available for processing the data. 
The selection procedure may also be 
terminated if it is believed that the A 
values are too small to justify adding addi- 
tional variables. It should not be too diffi- 
cult to develop tests to determine whether 
a given A differs significantly from zero. 
However, it is beyond the scope of this 
report to do so. It is proved in Part IV 
that the sum of all the A’s is given by 


A= + [31] 


where (Ci is the variance of the predicted 
criterion measure & or simply the square 
of the multiple correlation of criterion k 
with the i selected predictors. ‘The sum of 
these variances is what we have defined 
as the inex of multiple absolute prediction 
efficiency. 


B. Solving for the Regression Vectors 


For any specified set of predictors and 
criterion variables it has been proved in 
Differential Prediction that the least- 
square regression vectors will yield the 
highest index of differential prediction 
efficiency. In this report it is proved that 
the least-square regression vectors will 
also yield the highest index of absolute 


prediction efficiency. Therefore, for any 
specified set of predictors and criteria the 
same set of regression vectors will be used 
for both types of prediction. Assuming, 
however, a specified set of potential pre- 
dictors and a set of criterion measures, 
the procedures for selecting a subset of 
predictors of specified size will not in gen- 
eral yield the same subset for both types 
of prediction. However, once a subset of 
predictors has been selected for either type 
of prediction, the procedures for calculat- 
ing the regression vectors and multiple 
correlations are the same for both cases. 
Instead of repeating the procedures in de- 
tail we shall therefore give only a brief 
summary. The steps for computing the 
regression vectors are as follows: 

1. Make up a matrix L from certain of 
the elements in the G vectors indicated by 
Equations 21 and 26. All elements in and 
below the diagonal of the 1 matrix are 
zero. The elements in the first row are 
taken, with opposite sign, from the ath 
row of the ,g matrix and are, in order of 
selection, the elements corresponding to 
all but the first selected variable. The sec- 
ond row consists of those elements, with 
signs changed, from the »G vector (see 
Equations 11 and 12) corresponding, in 
the order of selection, to all but the first 
two selected predictors. Similarly for sub- 
sequent rows the elements are taken in 
order of selection, with signs changed, 
from the corresponding ,G vector. Assum- 
ing four selected predictors the L matrix 
may be indicated by 


| Law Lee | 

° Lue 

L= | 
° ° [32] 
° ° 


2. Next we make up a matrix the same 
order as L in Equation 32, which we call 
F’, with all zero elements above the di- 
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agonal, thus 


Pus ° ° 
Fa Foe ° 

= 
Poa Foa Fea Fae 


The diagonal elements of F’ are the 
reciprocals of the diagonal elements taken 
from the matrices. These are, 


1 
= 1 
ikaa 


Fy=-- | 
28 bb [34] 

1 
hee 


etc. 


The other elements of F’ are given by the 
following sets of equations: 


Lok cc 

[36] 
Fac= bet Lak cc 

F a= Leak aa 

cat Lod aa [37] 


La vat Lad cat aa 


In matrix notation, the above solution 
for F’, known as the “back solution,” may 
be expressed as the product of two super- 


matrices: 
@| (=) 
. 


The method of solving for the elements of 
F’ is given, in brief, in Part II of Differ- 
ential Prediction. 

3. To check the elements of F’ in 
Equation 37 we first consider a matrix 
£«« of intercorrelations of the selected pre- 
dictors arranged in the order of selection. 
We indicate the sums of these columns by 


Lgui, Ugvi, Vgei, and Yeas. Then the checks 
for the rows of F’ are given by 


Paw gaitF 

=1-—- 


[38] 


Actually it is not necessary to make up a 
table of the intercorrelations of selected 
predictors. The required sums may be 
readily computed directly from the ig 
matrix. If the final equation in Equation 
38 checks, this is also a check on the cor- 
relation sums. 

In matrix notation the checks indicated 
by Equation 38 are given by 


F'( =1—L1. [39] 


4. Next we make up a matrix ¢ such 
that the first row is the row of ,;7 cor- 
responding to the first selected predictor. 
The second row is the row of 27 corre- 
sponding to the second selected predictor 
and so on. We have therefore, 


| lai lag lan | 
| toy bon 
t= | |40| 
bales’: tew 
lan) 


5. To get the regression vector 84 for 
the kth criterion we use the F matrix and 
the kth column of ¢ in Equation go. In 
scalar notation we have 


Bar=F aatar +F HP acter 


[4x] 


etc. ) 


If we let 8 be the matrix of regression 
vectors in which the rows represent se- 
lected predictors and the columns criteria 
we can write Equation 41 in matrix nota- 
tion as 


[42] 
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C. Solving for the Multiple Correlations 


The solution for the multiple correla- 
tions proceeds exactly as in the case of 
Differential Prediction except, of course, 
that the selected predictors will not in 
general be the same. The check on the 
computations has, however, been simpli- 
fied. 

1. First we make up a matrix from the 
rows of the ,7J matrix corresponding to 
the selected predictors. The rows are 
written in the order in which the variables 
were selected and the matrix is designated 
1T (iy. 

2. We then compute the squares of the 
multiple correlations by means of the con- 
ventional scalar formula, 


[43 | 


where the r;, are the correlations of the 
selected predictors with a given criterion 
and the 6; are the corresponding beta 
weights. If we now consider only the i 
selected predictors and let 
iT, be the kth column of 
B, be the column vector of beta 
weights for the criterion k, and 
R, be the multiple correlation of cri- 
terion k with the predictors, 
we have in matrix notation 


R2=1T x. 144] 


3. The R,? are checked by summing 
them and seeing that the total is the same 
as the sum of the ;A given by Equation 31. 
In scalar notation the check is given by 


las] 


4. If desired, the square roots of the 
R,? computed in step 2 may be calculated 
to give simply the multiple correlations. 


III. A NUMERICAL ILLUSTRATION 
A. The Data 


In this part we shall illustrate the meth- 
ods outlined in Part II by a numerical 


example. The same data will be used as 
in the Differential Prediction report so 
that the results of the two methods may 
be compared. 

The sample consists of 2,243 entering 
freshmen at the University of Washington 
for the fall of 1947. There are eight pre- 
dictor variables as follows: the mean of 
high school grades in (1) English, (2) 
mathematics, (3) foreign language, (4) so- 
cial science, (5) natural science, (6) elec- 
tives, and the Q and L scores on the 
American Council on Education Exam- 
ination. 

The criterion measures are the grade 
point averages for ten different college 
course areas. These subjects, together 
with the number of cases for each course 
are: anthropology (603), chemistry (752), 
economics (1125), English (2005), foreign 
language (619), geology (475), history 
(583), mathematics (848), psychology 
(1255), and zoology (469). 


B. Computations for Selecting the 
Predictors 


The illustrations of the computational 
procedures will follow the same number- 
ing as the steps in Part II, Section A. 
Table 1 gives the same matrix of inter- 
correlations of predictor variables as was 
used in the Differential Prediction study. 
‘As in that study the predictor selection 
process is carried out until all eight pre- 
dictors have been selected, although the 
multiple regression vectors and the mul- 
tiple correlations are based on the first 
four selected predictors just as in the case 
of the former study. Table 1 is the one 
which we have designated as yg in Part IT. 
For checking purposes the matrix has been 
summed by rows and columns and a grand 
total is entered in the lower right-hand 
corner. 

Table 2 is the same matrix of validity 
coefficients used in the Differential Pre- 
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TABLE 1 


Tue MatRix oF Prepictor INTERCORRELATIONS 


5 6 7 8 


1 2 3 4 z 
| 1.0000 - 5647 -6437 7193 5918 -4719 -3470 4.4528 
2 | 1.0000 §893 5366 5801 3737 3490 +2545 | 4.2485 
3 | +6437 -§893 1.0000 .5679 4003 «1458. 2983 4.1943 
4 -7193 5306 -§079 1.0000 - 5936 -4051 1474 +3230 | 4.2929 
5 | .§918 5801 §490 -5936 1.0000 -4372 . 2000 3185 4.2702 
6 -4719 -3737 4003 -4051 -4372 1.0000 1405 1168 | 3+3455 
7 1144 3496 -1458 -1474 . 2000 1405 1.0000 4160 | 2.5137 
% -3470 2545 2983 3230 1168 .4160 ~=1.0000 3-0741 
z 4-4528 4.2485 4.1943 4.4929 4.2702 3.3455 2.5137 3.0741 | 30.3920 
Check 


diction report. It is the transposed form of 
what we have called ,T in Part II. 

1. The first row at the bottom of Table 
2 consists of column summations. The a 
row consists of sums of squares of column 
elements. These are given by Equation 1 
of Part II. The largest value in the a row 
is 1.6671 for variable 5. This variable, 
therefore, becomes the first selected pre- 


30.3920 


sulting values are entered in the ‘“‘check”’ 
row at the bottom of 2g. The summation 
row of og is obtained by adding across a 
row through the diagonal and then down 
the column, exclusive of the check item. 

We may illustrate the use of equations 3 
and 4 of Part II for getting the first col- 
umn of 29 as follows: 


dictor, namely, variable a. 1.0000 — XX 
2. Next we compute the sg matrix 2 .5647 — .s801 X .so18 = .2214 
3 .6437 — .§490 X .5918 = .3188 
shown in Table 3. This is computed by = 
means of Equation 3or4of Part II. Since 5 .5918 — 1.0000 X .5918 = 0000 
6 4719 — .4372 X .§918 = .2132 
compute only the elements in and below 8 .3470 — .3185 & .s918 =  .1585 
the diagonal ; the summation row of ig is Z 4.4598 — 4.2703 X 9918 = 1.9957 Check 
included in the computations and the re- 1.9257 = 
TABLE 2 
Tue Marrix or Corrricients 
Predictors 
Crit. — ——| Total 
I 2 3 4 5 6 7 5 
3357 4188 = .4703 2257). 3.764 
2 | .3488 3737 3876 .4906 2855 .2958 | 
| ,4028 .4200 3045 . 449° -4329 . 2166 .2727 | 
4 4865 .3674 .2055 .4427 | 
5 | «3968 3852 4400 3793 3855 2794 1288 2419 
6 .3383 -3349 .3701 3836 .1972 3185 
7 3440 2335 3208 3851 . 2870 0952 | 
8 | +3139 3796 . 2928 .4156 . 2673 . 2998 226. 
9 | +3555 3029 3228 3969 -3799 .1992 197% 
10 449° 4lig .4056 -2375 2633 4210 
z 3.8315 3-7259 3-6994 3.9930 4.0601 2.3300 2.1809 4.3168 27.1370 
a 1.493! 1.4361 1.3844 1.6181 1.6771 0.5609 «00. 5176 1.1470 
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TABLE 3 
THE og MATRIX 
1 2 3 4 5 6 7 8 
I -6498 
2 | 
3 3188 . 2708 .6986 
4 3680 -1923 .2420 
5 ° ° ° ° ° 
6 .2132 .1201 1603 1450 ° Bo8g 
7 — ,0040 2336 .0360 0287 ° .0531 .g600 
8 .1585 .0097 1234 1339 ° — .0224 8980 
Check 1.9257 1.7714 1 8500 1 7581 ° 1 4736 1.60597 1.7140 
Zz | 1.9257 1.7714 1.8499 1.7581 o 1.4788 1.6597 1.7140 
To get the second column of og we use Ty 
I -4378 — .3950 X .5918 = .2040 
the second column of ig and the second — 4966 .so18 = .os40 
element of the fifth column of ig. The re- 3 4028 ~ X «5918 = «1400 
4 — .4184 X .5918 = .2389 
maining columns are obtained by using 3968 — X .so18 = . 1087 
6 .3383 — .3836 X .so1r8 .1113 
the appropriate columns of ,g and the cor 
responding elements of the fifth column 3139 — X .so18 = 
of 1g. 9 +3555 -3799 X .5918 = .1707 
10 — .46560 X .5918 = .1316 
3. ThesT matrix is computed by means — 
of Equation 5 or 6 of Part II fromthe 43-8315 ~ 4.0001 = 1.4288 Check 


matrix, its ath or fifth row and the fifth 
column (or row) of the ;g matrix. Table 4 
is the 27’ matrix in transposed form. The 
use of Equation 5 or 6 of Part II may be 
illustrated for getting the first column of 
2T’ as follows: 


1.4287 


The remaining columns of 7” are ob- 
tained in the same way by using the ap- 
propriate columns of the ,7’ matrix and 
the appropriate element from the fifth row 
of ig. 


TABLE 4 


I 2 3 
I . 2040 . 1066 2019 
2 -0549 1835 
3 | .3466 . 1689 
4 2389 -1247 1748 
5 1087 -1616 
-1742 .0670 1632 
8 | 0679 1846 
9 .0825 
10 | 1316 -0975 

Check | 1.4287 1.3706 1.4704 I 
Zz 1.4288 I 


Tue 27’ Matrix 


4 5 6 7 8 
. 2448 ° .0326 1467 2506 
.0928 ° 1965 1309 
.1920 ° .0273 2342 
1926 ° 3004 
1505 ° 1109 .O517 -1191 
1424 ° 02905 1196 1963 
2147 ° .0317 .0378 
0460 ° .2107 
1714 ° .2177 
1355 ° +0339 -1702 2727 
5829 ° 0.5549 1.3089 2.0237 
5827 ° °.5549 1.3689 2.0235 


: 
a | .2334 2052 2342 2814 0406 2188 4532 
b |  .3592 3093 3352 4345 ° .0§02 5043 
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TABLE 5 
Mareix or Vectors with OF THE DIAGONAL ELEMENTS FROM 


1/ show 


| 
| 


I .00000 
1.11284 
1.59337 
1.65180 
1.85770 
1.35870 
2.70197 
1.39606 


DM 


4. The rows at the bottom of the ,7’ 
matrix are not quite the same as for the 
1:7’ matrix. The first row at the bottom is 
labeled “‘check”’ and is obtained by apply- 
ing Equation 5 of Part II to the 2 row of 
iT’, The second or 2 row consists simply 
of column sums, exclusive of the check 
entry and should, within rounding errors, 
be the same as the check row immediately 
above it. Row a is obtained in precisely 
the same way as row a of ,7”. It consists 
of the sums of squares of column elements 
given by .V, of Equation 7 or 8 of Part II. 

5. Row 6 is obtained by dividing each 
element in row a by the corresponding di- 
agonal element of the og matrix. These are 
the »A, values given by Equation 9 or 10 
of Part IL. For example, the first element 
in row b is 


It may be noted that in the case of ,7’ we 
would also have a 6 row obtained by di- 
viding each element in its @ row by the 
corresponding diagonal element of the \g 
matrix. However, the diagonal elements of 
the ,g matrix are all unity, Therefore, the 
b row for the ;T’ matrix would be identi- 
cal to its @ row, 

The highest value in row b of Table 4 is 
.§043 for variable 8. Therefore, the second 
selected predictor 6 is variable 8. 

Using Equation 31 of Part II we have as 


THE CORRESPONDING ig VECTORS 


5 6 Zz 
-4372 
-0249 
+2373 
. 1298 
-1§75 
. 2824 

1.0000 


2702 
-9°75 
3943 
2796 
+9799 
. 2824 
.0000 


our index of absolute prediction efficiency 
for the first two selected variables 


he = Ast 2ds= 1.6771 +.5043 = 2.1814. 


6. Each element of the dth or eighth 
row of og in Table 3 is now divided by the 
eighth diagonal element, as described be- 
fore, to get the 2G values given by Equa- 
tion t1 or 12 of Part II and the results are 
entered in the second row of Table 5. The 
first row is simply the ath or second row 
of the ,g matrix of Table 1. Since the re- 
ciprocal of the eighth diagonal element of 
the 2g matrix is required in subsequent 
computations, it is computed first and 
entered to the left of the second row of the 
G matrix. Then each element of the 
eighth row of og is multiplied by it to get 
the other entries in the row. The check 
sum and the eighth element are both in- 
cluded. The eighth element in the second 
row of Table 5 should be unity. This 
checks the computation of the reciprocal. 
The total of the entries exclusive of the 
reciprocal and the check is entered at the 
extreme right of the row. This should be 
the same as the check entry immediately 
to its left. 

7. The sg matrix is now computed from 
the og matrix of step 2, its dth or eighth 
column and the 2G,’ vector of step 6. The 
procedure is exactly the same as for step 2 
except that the constant multiplier for a 
given column is the corresponding element 
of the Gs’ vector in the second row of 


10 
| 1 2 3 4 
-§918 .5801 .5490 .5936 
-1764 .0776 .1373 «1490 
-5488 .2398 .3563 1.0000 
-1805 1.0000 .3244 ° 
-2578 ° 1.0000 ° 
— ,0988 ° ° ° 
1.0000 ° ° ° 
° ° ° ° 
.2334 
24, = = ——— = ,3592. 
ir 
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TABLE 6 
THE 3g MATRIX 


4 


-6817 

+2236 
° 

.1634 

— .O124 
° 


Out 


2063 
° 


— ,0238 


° 


1.6234 


1.6384 1.6146 


1.6234 


1.6384 1.6145 


Table 5. The sg matrix is given in Table 6. 

8. Next the 37’ matrix is computed 
from the »7’ matrix of step 3, its bth or 
eighth column and the G5’ vector of step 
6. The procedure is the same as for step 3 
except that the constant multiplier for a 
given column is taken from the corre- 
sponding element of the »Gs’ vector in the 
second row of Table 5. Table 7 gives the 
37’ matrix. 

9. Row aof the 37” matrix in Table 7 is 
simply the sums of squares of column ele- 
ments as in the case of ,7’ and 27” in 
steps 1 and 4. 

10. The 6 row of 57” is obtained in the 
same way as for 27’ except that now each 


1.5027 


1, 5026 


element of row a is divided by the corre- 
sponding diagonal element of the sg matrix 
of Table 6. The largest value in row 6 of 
Table 7 is .3005 for variable 4. Therefore, 
the third or cth predictor is variable 4. 
By means of Equation 31 of Part II we 
have as our index of absolute prediction 
efficiency for the first three selected pre- 
dictors 


= 2.1814 +.3005 = 2.4819. 


11. Steps 6 through 1o are repeated to 
select the fourth predictor. 

a. The 3G,’ vector in the third row of 
Table 5 is obtained from the fourth row 
of the sg matrix in Table 6 and its fourth 


TABLE 7 
Tue 37’ Matrix 


.O717 
1096 
0344 
-0300 
.0879 
.O407 


ooooo000000 


0401 
.0570 


11 
I é 3 | 5 6 7 8 
| -6218 
|. 2091 6581 
.2970 2612 
| +3444 . 1819 
° ° ° 
| .2172 -1218 ° 8083 
— .0661 ° .0619 8219 
° ° ° ° ° 
Check | ° 1.5215 -9876 ° 
> | ° 1.5215 -9878 ° 
I 2 3 4 
.1598 .0872 2075 
-0318 0831 
1053 .1§07 .0946 .1§71 
. 1843 . 1007 -1323 .1465 
-1477 -1524 .2120 .1328 
.0767 .0972 .1132 
-1392 -0516 -1359 1851 
-O513 -1773 .0320 
0923 0656 0843 1390 
1 -0835 -1§77 .o601 .0949 
Check | 1.0719 1.2137 1.1925 1.2812 ° -5755 ° 
Z | 1.0719 1.2137 1.1923 1.2814 ° -5755 ° 
a .1368 1664 1886 ° .0734 
b | +2200 .2528 2373 3005 ° ° 
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TABLE 8 
THE «g Matrix 


-773° 
-0675 


.8210 
° 


diagonal element. 

b. The 4g matrix of Table 8 is obtained 
from the sg matrix of Table 6, its fourth 
column and the third row of Table 5. 

c. The,4T7’ matrix of Table g is obtained 
from the 57’ matrix of Table 7, its fourth 
column and the third row of Table 5s. 

d. The a row of Table 9 is simply the 
sums of squares of column elements of the 
PY iad matrix. 

e. The ,d, valués in row b of Table 9 are 
obtained from the a row of Table 9 and 
the diagonal elements of Table 8. 

The procedure has been repeated until 
all eight predictor variables have been 
selected. The remaining rows of Table 5 


1.1649 1.0447 


° 1.1649 1.0448 


give the successive G’ vectors in the order 
in which the variables were selected. 
Tables 10 through 17 give alternately the 
remaining ,g and ,7’ matrices. 

Vor convenient reference the successive 
indices of absolute prediction efficiency 
are summarized below. 


M = Ap = 1.6771 
= .6771 
= .1814 
.4819 
-6492 
.7218 
-7722 
-7949 


2 
3005 2 
- 1673 2 
-0726 2. 
-0504 2 
-0227 2 
-O119 2 


C. Computations for the Regression 
Weights 
In solving for the regression vectors we 


TABLE 9 


Tue Marerix 


Oc 


12 
I 2 3 4 5 6 7 8 
1 -4328 
2 . 1093 6054 
3 1964 .6020 
4 ° ° ° ° 
5 ° ° ° ° ° 
6 | +4355 -0786 1103 ° ° 
7 .0530 .2132 — .0039 ° ° 
8 | ° ° ° ° ° || ° 
Check | .7988 1.2029 1.0791 ° ° 
+ 1814 
4819 
+ 6492 
+ 7218 
+ 7722 
+ 7949 
+ 8068 
1 2 3 4 5 6 7 8 
-0270 .0936 
- .0570 
| 1052 
| .0582 .o8o1 
.0748 . 1139 . 1647 
.0146 .0044 -0437 
.0376 ,0020 .0699 
-0337 . 1680 .1271 
-0253 .0348 
-O314 .1302 .0263 
Check | 3687 8424 -7387 ° ° 6241 ° 
Zz | 3686 - 7358 ° ° .6242 ° 
a .0229 .0719 ° ° .0224 .O775 ° 
b -0529 +1194 ° ° .0290 -0943 ° 
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TABLE 10 
THE og MatRrix 


Tue Matrix 


| 


OO OMS 


TABLE 13 
Tue 67” Matrix 


6 


wn 


eooooooco°o 


° 
° 


° 

° 


13 
I 2 3 4 5 6 7 8 
I 
2 ° ° 
3 . 1388 ° 
4 ° ° ° ° 
5 ° ° ° ° ° 
6 .1213 ° .0848 ° ° . 7628 
7 | ° — .0731 ° ° .0398 -7459 
8 ° ° ° ° ° ° ° ° 
Check | 5818 ° 6889 ° ° 1.0088 6211 ° 
z .§817 ° 6888 ° 1.0087 6211 ° 
TABLE 11 
Check ° ° ° 1918 ° 
z .2165 ° .4026 ° ° -1917 -3274 ° 
a .0163 ° .0391 ° ° 0143 .0342 
6 -0395 ° .0726 ° ° .0187 .0458 ° 
7 8 0334 .0954 0 
—_| —,0186 .0638 o 
0061 oO 
-0475 —.0127.0 
.0273 0 
—.0181 —.0227 
.1317 
.0020 0 
-0038 o 
-1188 §=.3902 
-1189 39902 
b | 0 0 0 .0139 0 
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TABLE 14 
THe 1g Matrix 


I -3701 
2 ° ° 
3 ° 
@ 
5 ° © 86 
6 | 9 0 7458 
» ine ° ° 
Check | 4746 © © © .8503 Oo 
z Gut 


shall assume as in the case of Differential! 
Prediction that only the first four selected 
predictors are to be used. The successive 
steps to be followed are numbered to cor- 
respond to the steps in Part II, Section B. 

1. We first prepare the L matrix of 
Equation 32, Part LI from the appropriate 
entries in Table 5. This gives us Table 18. 
The first row of Table 18 is obtained from 
the first row of Table 5. It includes, in the 
order of selection, only those elements cor- 
responding to the selected variables. Note 
that the first entry, the one corresponding 
to the unit element, is zero. The other ele- 
ments are all copied with opposite sign 
from the first row of Table 5. 


TABLE 15 
Tue 77’ Matrix 


I 6 7 8 

1 0250 0 09 —.0314 © 
2 0 0 © O 0268 
3 | © —.0230 0 
4 0770 © © © 
5 .0200 0 0 0 O 0484 0 0 
6 —,0002 © 0 0 —.0004 0 
7 -0176 0 0 © —.0165 o 
—,.0023 © © .0379 Oo 
9 0088! © © © —,.0045 
10 0139 © © © .0025 
Check 1358 0 0 © oO 0917 0 Oo 
z 1358 © © Oo 0918 
a 0000 0097 
b | 0227 0 © 0 O 0130 © 


TABLE 16 
THE sg MATRIX 


| 


I ° 
2 
3 
4 o es © 
7 @ @ ° 
Chek ic 0 @ © 
Zz © © @. 
TABLE 17 
THe Matrix 
6 
I 0 0 8 —.038s5 ° 
2 8.46 ° 
3 ° 0 80 —.0245 ° 
4 -03603 ° 
6 0 0 80 @ —.0063 ° 
8 ace’ .0385 ° 
9 0 —,.0068 ° 
10 | 0 © 0 © ° 
Check | 0 © © 0534 «0 «OO 


° 
° 
° 
° 
° 


The first two entries in the second row 
are zero. The remaining elements are 
copied, with sign reversed, from the ele- 
ments in the second row of Table 5 cor- 
responding in order of selection to all but 
the first two selected predictors. The re- 
maining rows are similarly obtained from 
the corresponding rows of Table s. 


TABLE 18 
Tue L Matrix 


5 8 4 a +4 z 
5 —.3185 —.5936 —.5801 |—1.4922 
8 ° ° —.1490 —.0776 |— .2266 
4 ° ° ° — |— .2898 
2 ° ° ° ° ° 


° 
| 
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TABLE 19 
Tue F’ Matrix CHecks 
Checks* 
5 8 4 2 z — —— 
1—Ll F'(g@) 
5 1.0000 ° 1.0000 2.4922 2.4922 
8 — .3544 1.1128 ° ° 7584 1.2266 1.2266 
4 — .8702 — .2374 1.5934 ° 4858 1. 2898 1.2901 
2 | —.6559 — .0569 — .4787 1.6518 4003 1.0000 1.0001 
— .8805 8185 1.1147 1.6518 2.7045 
* See text 
TABLE 20 — .2374=(—.1490) X 1.5934, 
—.8702 = (—.3185) X(—.2374) —.5936 
| 5 8 4 1.5934- 
5 | 1.0000 .3185 .5936 .5801 | 2.4922 ‘Similarly by means of Equations 37 of 
8 | .3185 1.0000 .3230  .2545 | 1.8900 = Part II the third, second, and first ele- 
4 | -5036 .3230 1.0000 .5366 | 2.4532 
2 | : 2.3712 ments of the fourth row of Table 19 are 


. 5801 


The rows of the ZL matrix are summed 
and the sums are entered in the corre- 
sponding rows of the 2 column at the right 
of the LZ matrix. 

2. Next we prepare the F’ matrix of 
Equation 33, Part II as shown in Table 
19. The diagonal elements of this matrix 
are copied from the left-hand column of 
Table 5. These are the elements indicated 
by Equations 34 of Part II. To get the 
first element in the second row of Table 
19 we use Equation 35 of Part II. This 
gives 


—.3544= —.3185 1.1128. 


The second and first elements in the 
third row of Table 19 are, respectively, by 
Equations 36 of Part II 


computed, 

The columns of F’ are summed for sub- 
sequent checking purposes. 

The elements in the second column from 
the right in Table 19 are obtained by sub- 
tracting from unity the value obtained for 
the corresponding row sum in Table 18. 

3. To check the F’ matrix we first 
compute the sums of columns (or rows) 
of the matrix of intercorrelations of the 
selected predictors. These sums may be 
calculated directly from the ig matrix of 
Table 1 without actually copying out the 
matrix for the selected predictors. To 
clarify the procedure, however, the ma- 
trix of intercorrelations for the selected 
predictors is given in Table 20 together 
with the row sums. 

It will be recalled that the entries in the 
second column from the right in Table 19 


TABLE a1 
Tue Matrix 


I 2 3 4 5 


5 .4906 .4329 .4184 3855 
8 .2506 .1309 .2342 .3094 
4 .2075 .0733 «1571 .1465 .1328 
2 


| .0270 .1§21 .1052 


.1139 


6 7 i] 9 10 Zz 
. 3836 2870 «6.4156 .3799 4056 4.06001 
1963 1985 .0941 .2177  .2727 2.0235 
1132 .0320 .1390 .0949 1.2814 
0644 —.0020 .1680 0.8423 


.0253 .1302 
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TABLE 22 


I 2 3 4 

is 
5 | .2867  .1442 1431 
4 | .3177  .0440 2050 
2 .2512 1738 

Check | .6983 .7015 .7353 -7510 

6983 


are unity minus the corresponding row 
sum in Table 18. These are the values 
given to the right of Equations 38 in Part 
II. The left sides of Equation 38 or 39 are 
obtained by summing the products of the 
row sums of Table 20 by the correspond- 
ing elements of a row of the F’ matrix. 
For example, the check for the last row of 
F’ is 
— .0559 (2.4922) — .0569 (1.8960) 
— .4787 (2.4532) +1.6518 (2.3712) 

= 1.0001. 
The entries to the extreme right in Table 
19 are these product summations and 
should check within rounding errors with 
the entries at their left. 

4. The rows of the ¢ matrix given by 
Equation 4o of Part II are taken from the 
appropriate columns of the ,7’ matrices. 
This matrix is given in Table 21. 

The first row is the ath or fifth column 
of the ,7’ matrix of Table 2. 

The second row is the bth or eighth 
column of the 27’ matrix, and so on. 

5. By means of Equation 41 or 42 of 
Part Il the matrix of regression vectors 8 
in Table 22 is obtained from Tables 19 
and a1. 

The first column of Table 22 is obtained 
from Table 19 and the first column of 
Table 21 as follows: 

The first element of the column is the 
sum of products of corresponding ele- 
ments in the first column of Table 19 and 
the first column of Table 21. The compu- 


-§927 


Tue 6 Marrix 


5 6 7 8 9 10 
. 1530 1733 .0569 2442 .2010 
1879 «17771 .0876 .2078 .2735 
-1571 1495 .2959 —.0294 .2094 .0889 
5928 6171 5265 -5798 -6242 7784 
6171 5266 


.5799 .6242 .7785 


tations for the first element are 
1.000 (.3950) —.3544 (.2506) —.8702 
(.2075) —.6559 (.0270) = .1079. 
The second element in the column is the 
sum of products of corresponding elements 
in the second column of Table 19 and the 
first column of Table 21. Each succeeding 
element in the column is the sum of prod- 
ucts of corresponding elements in the cor- 
responding column of Table 19 and the 
first column of Table 21. The element in 
the check position for the first column of 
Table 22 is the product of corresponding 
elements in the summation column of 
Table rg and the first column of Table 21. 
The element in the summation position 
in Table 22 is the sum of all elements in 
the column exclusive of the check entry. 
The check and summation entry should 
be the same within limits of rounding 
error. 

The elements in the second column of 
Table 22 are obtained in the same way as 
the first except now the second column of 
Table 21 is used instead of the first. The 
remaining columns are obtained by using 
the corresponding column of Table 21 
with Table 19. 


D. Computations for the Multiple 
Correlations 


The conventional method for solving 
for the squares of the multiple correlations 
is indicated by Equation 43 or 44 of 
Part I. 
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TABLE 23 
Tue 74) Matrix 


I 2 3 4 5 6 7 8 9 10 | z 
5 3950 .4966 .4329 .4184 .3855 .3836 .2870 .3799 .4056 4.0601 
8 -3764 .2891 .3721 .4427 .2419 .3185 .2899 6.2265 .3387 .4210 3.3168 
4 -4793 «3876 .4490 .4410 .3793 .3701 .3851 .2928 .3069 | 3.9930 
2 3357-4716 .4200 .3674 .3349 .2335 | 3.7259 
R .3125 .3060 .3214 .2139 .2173 .1808 .2308 .2289 .3419 2.6492 
R «5590 .5532 .5069 .4625 .4662 .4252 .4805 .4784 .5847 


1. For computational convenience it is 
well to copy in row form the columns 
from Table 2 corresponding to the selected 
predictors. These should be copied in the 
order selected as shown in Table 23 and 
are designated as the ,74) matrix. The 
sums should also be copied and the rows 
totaled to check the copying. 

2. The squares of the multiple correla- 
tions are then computed by getting the 
sums of products of corresponding ele- 
ments of corresponding columns of Tables 
22 and 23. These are entered in the row 
immediately below Table 23. The first 
entry in this row will be obtained from the 
first columns of Tables 22 and 23 as fol- 
lows: 

2957 = .3950 (.1079) +.3764 (.2281) 
+.4793 (.3177) +3357 (.0440). 

3. The check for the R’’s is given by 

A, =ZR,’, 


where & refers to the criterion variable. 
In this case d; is \y. We have therefore, 


a= 2.6492 = ZR? = 2.6492. 


4. The last row under Table 23 is 
merely the square roots of the elements 
in the row above, and gives the multiple 
correlations rather than their squares. 


IV. MATHEMATICAL DERIVATIONS 
A. The Index of Absolute Prediction 
Efficiency 
The index of absolute prediction effi- 
ciency employed in the methods outlined 
in the previous sections was based on the 


assumption that we wish to obtain from 
the predictors the best estimates in the 
least-square sense for all of the criterion 
measures. We assume standard measures 
for both predictor and criterion scores. 
We let: 
M = number of cases, 
n=number of predictors, 
N=number of criteria, 
X=the (MXn) matrix of predictor 
measures, 
Y=the (MXN) matrix of criterion 
measures, 
Z=the (MXN) matrix of best least- 
square estimates of VY obtained 
from X, 
B=the (nN) matrix of least-square 
regression vectors for estimating Y 
from X, 
ig=the (mn) matrix of intercorrela- 
tions of the complete set of poten- 
tial predictors from which the sub- 
set is to be selected, 
iT =the (nN) matrix of validity co- 
efficients, i.e., the correlations of 
the predictors with the criteria, 
r=the matrix of intercorrela- 
tions of the criteria, 
C=(NXN) matrix of covariances of 
the predicted criterion measures. 
Because of the above definitions, 


[1] 

[2] 
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(3) 
ZZ =C, l4] 

Xp=Z. 


We now consider the residual matrix 
[6] 


It is the trace of E’E which we wish to 
minimize. 

The solution for 8 in Equation 6 which 
minimizes the trace of E’E is well known 
to be 


B=(X'X) X’Y. {7 | 
Substituting 1 and 2 in 7 gives 
B=ig iT. [8] 
Substituting 8 in 6, gives 
E=(Y—Xg™ 7). lo] 


From 9 we have 


—,7" gt X’Y—Y'X, 


+,T7" iT), [10] 


or substituting from 1, 2, and 3 in 10 
gives 

M 

Obviously if we minimize the trace of 
E'E/M we also minimize the trace of 
E’E. Furthermore, the diagonal elements 
of r are all unity, hence the trace of r is V 
or a constant. Therefore, we may take as 
an index of multiple absolute prediction 
efficiency the trace of the right-hand term 
on the right of Equation 11, and designat- 
ing this by \ we have 


iT). [r2| 


But it can readily be shown from 1, 4, 5, 
and 8 that 


C=,T’ go iT. 113] 


According to 12 and 13 therefore, d is 
simply the sum of the diagonal elements 
of C or the sum of the variances of the pre- 
dicted criterion variables. But it is well 
known that the diagonal elements of the 
right side of Equation 13 are simply the 
squares of the multiple correlations of the 
criteria with the predictors. 


B. The Predictor Selection Formulas 

It should be clear then that for any 
given set of predictor and criterion vari- 
ables the matrix of vectors 8 which will 
yield the highest index of multiple abso- 
lute prediction efficiency is given by 
Equation 8 and is the one which will yield 
the most accurate estimates, in the least- 
square sense, of the separate criterion 
measures. As shown in Differential Pre- 
diction this is also the matrix which will 
yield the highest index of multiple differ- 
ential prediction efficiency. However, the 
problem is to select from a larger set of 
predictors that subset which will yield a 
as given in Equation 12as large as, or larger 
than, any other subset of equal size. As in 
the case of multiple differential prediction, 
it seems probable that for a given sample 
of data the only way that the subset of 
specified size having maximum \ could 
be found is actually to calculate the \’s 
for all possible sets of the specified size. 
However, suppose we select first the single 
predictor with the largest A; to this we 
add the predictor which yields the great- 
est increment to the \; and we continue 
in this way to select one predictor at a 
time until the subset of desired size has 
been selected. As in the selection pro- 
cedure for the differential prediction bat- 
tery we shall assume that the selection of 
one predictor at a time will yield a subset 
whose \ is sufficiently close for practical 
purposes to that of the subset of equal 
size with the largest \. 
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We begin, therefore, by deriving recur- 
sion formulas based on Equation 12 but 
involving a total number of predictor 
variables smaller than the total number 
available. Let us assume that we have al- 
ready selected i predictors and calculated 
the corresponding A, say Ay, and that we 
wish to find which of the remaining n—i 
predictors will yield the largest increment 
to 

We let: 

ga) =the (iXi) matrix of intercor- 
relations of i selected predic- 
tors arranged in the order of 
selection, 

T =the matrix of correla- 
tions of the i predictors with 
the \ criteria arranged in the 
order of selection, 

£ =the matrix of intercorrelations 
of the i selected predictors and 
one unselected predictor k, 

T iiy =the matrix of correlations of i 
selected predictors and one un- 
selected predictor k with the NV 
criteria, 

£coa=the column vector of correla- 
tions of the 7 selected predic- 
tors with unselected predictor 
k, 

T,.=the column vector of correla- 
tions of predictor k with the V 
criteria, 

Cy =the covariance matrix of pre- 
dicted criterion scores derived 
from the i selected predictors, 

C 41m = the covariance matrix of pre- 
dicted criterion scores derived 
from the i selected predictors 
and an unselected predictor k. 

According to the above definitions: 


= 
we 1 

T 
T (i414) = 


[14] 


Using Equation 13 we have: 

Cy = [16] 
Suppose now we define® 


= 


[18] 


Then we can prove that 


[19| 


—Bu 
( 1) 


Substituting Equations 15 and 1g in 17 
gives 
C = T 'T 
+ (Tr! — By’ T 


[20] 


(In the following development, the dis- 
tinction between the pre- and postscripts 
should be constantly borne in mind. The 
prescript refers to the number of pre- 
dictors selected [that is, the number of 
iterations] and has no reference to the 
order of the matrix. The parenthesized 
postscript, however, refers to the number 
of predictor variables represented in the 
matrix, and hence affects the order of the 
T and g matrices to which it is attached.) 

Suppose now we let: 


ily. = Tx. T 


[ar] 
and 
[22] 


Substituting Equations 16, 21, and 22 in 
20 gives 


Bee =1— 


iT 


iR kk 


2 Not to be confused with 8 defined in Equation 
8. 
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Now if we let and be the 
diagonal matrices of the diagonal elements 
of Cw and respectively, and 
and isi, the corresponding indices of 
absolute prediction efficiency, then be- 
cause of Equations 12 and 13 


Aw 1, 

and if we let 


we have because of Equations 23, 24, and 
25 
iT.’ 

Equation 27 corresponds to Equation 25 
of Part II. This equation indicates that 
the crucial requirement for the selection 
procedure is that recursion formulas be 
available for the computation of the (A, 
increments. 

In order to develop the recursion form- 
ulas, first let us consider the equation 
analogous to Equation 21 assuming that i 
predictors plus predictor k have been se- 
lected and that we wish to investigate the 
unselected predictor p. We would have, 


[28] 


Suppose now we let g,, be the correlation 
between predictors & and p, and we define 


= Brink Bid [29] 


and 


Grp 


Then as was demonstrated by means of 
Equations 62 through 71, Part IV of 
Differential Prediction, we can rewrite 
Equation 28 as 


If now we assume that predictor & was 


the (i+1)th variable selected, let k=w, 
and write Equation 31 in matrix form to 
include all values of p from 1 to m we 


have 
=iT— Ge [32] 


where ,7,,.’ is the wth row from ,J and 
G, is a column vector. Equation 32 cor- 
responds to Equation 28 of Part II. 

Our next step is to get recursion formu- 
las for the ige, values given by Equation 
29. 

Suppose we have selected w as the 
(i+1)th predictor. We wish to find all 
“+gep Values. From Equation 29 we 
write 


[33] 


Using the methods of Equations 74 
through 76 of Differential Prediction we 
rewrite Equation 33 as 


Bup 


\34] 


(i+ 1) = 
We let: 
«ang the matrix for all values 
of k and p on the left of Equa- 
tion 34, 
i= the (mm) matrix for all values 
of kand pin the first term on the 
right of Equation 34, 
= the wth column vector of 
Analogous to Equation 30 we define 
ik w 


iRww 


Using Equation 35 and the definitions 
just given, and assuming predictor w as 
the ith predictor selected, we rewrite 
Equation 34 in matrix form as 


= k.w Ge: 136] 


Equation 35 corresponds to Equation 26 
of Part II while Equation 36 corresponds 
to Equation 27 of Part IT. 

The selection process begins with the 
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1T matrix of validity coefficients. By 
means of Equation 27 for the case of i= 1 
we compute ,A, for all values of k. Since 
all diagonal values of ig are unity only 
the numerator term of Equation 27 is re- 
quired. The predictor with the highest ,A 
is the first one selected. 

By means of Equation 36 a 2g matrix is 
computed where w is the first or ath vari- 
able selected. Since all diagonal elements 
of 1g are unity, :G,.’ given by Equation 35 
is the same as ig. This operation is iden- 
tical to the calculation of the »9g matrix 
described in Differential Prediction. 

A»T matrix is now computed by means 
of Equation 32 where i=1 and w is the 
first variable selected. By means of Equa- 
tion 27 a second predictor is chosen. The 
routine continues until the desired num- 
ber of predictors has been selected. 


C. The Matrix of Regression Vectors 


The formulas for computing the matrix 
of regression vectors are exactly the same 
as in the case of Differential Prediction. 
We shall, however, indicate and prove a 
superior checking procedure for one of the 
formulas. First we let 7, be a vector of 
those elements from G,,of Equation 3 5cor- 
responding to the selected predictors and 
arranged in the order selected. Then yj, 
is defined as a matrix whose rows are the 
‘Yo’ vectors, where w goes from a to w. 
We also let 


faa O ° 
° 28 bb 


Then it can be shown, because of the 
method by which y,,, was computed, that 


= Va) 138] 
We now define a matrix F by 
F= (Dey). 139] 


This is the transpose of ‘he matiix indi- 
cated by Equation 33 of Part Ii. The 
method of computation is precisely the 
same as outlined in Differential Predic- 
tion. However, a superior method for 
checking the computations of F’ is given 
by Equations 38 and 39 of Part II. Equa- 
tion 39 of Part IT is 


=1—LA, 


where L is defined by the equation 
lar] 


We can readily prove that Equation 40 is 
an identity. From Equation 39 we have 


L=I—yu). 


\42] 


Substituting Equations 38, 41, and 42 in 
40 we have 


(De 1 


or 


l44] 


The solution for the regression vectors, 
once the F matrix has been computed and 
checked, is indicated by Equation 41 or 42 
of Part IT, and proved in Differential Pre- 
diction. 


D. The Multiple Correlation Formulas 


The method for computing the squares 
of the multiple correlations indicated in 
Equation 43, Part II, Section C, is the 
same as one of the methods indicated in 
Differential Prediction. This method is 
well known and therefore no proof of the 
method was given in Differential Predic- 
tion nor will it be given here. However, 
the check for the computations indicated 
in Part IT, Section C, may not be obvious. 
We need only recal! that the definition of 
d given by Equation 12 for any set of pre- 
dictors is the sum of the diagonal elements 
of the matrix in Equation 13. 
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From Equations 8 and 13 


C= ,T"B. 145] 


But the kth diagonal element of Equation 
45 is given by 


The right side of Equation 46 is the same 
as the right side of Equation 44 of Part II. 
We have, therefore, from Equations 12, 
13, 26, and 46 

l47| 


where the R,’ values are the squares of the 
multiple correlations of the i selected pre- 
dictors with the various criteria. 


V. SUMMARY 


This report presented (a) a short dis- 
cussion of the multiple absolute prediction 
problem as contrasted with that of differ- 
ential prediction, (b) a detailed descrip- 
tion of the computational methods de- 
veloped for selecting the absolute predic- 


tors and solving for the regression vectors 
and multiple correlations, It also includes 
(c) a numerical illustration of the compu- 
tational procedure, and (d) the mathe- 
matical derivation of the formulas de- 
veloped. 

The multiple absolute prediction prob- 
lem, as distinguished from that of differ- 
ential prediction was stated as the prob- 
lem of selecting from a battery of potential 
predictors that subset of specified size 
which will have the highest prediction 
efficiency for all the criterion variables, 
irrespective of how well it differentiates 
among them. The index of the prediction 
efficiency of the selected battery was de- 
fined as the sum of the variances of the 
predicted criteria irrespective of their co- 
variances. 

In both differential and absolute pre- 
diction, the end result is a set of predic- 


tions of success in each of the criterion 
activities. The weights to be used in both 
cases are found to be the conventional 
least-square regression weights, the essen- 
tial difference between the two cases being 
that the sets of selected predictors are not 
in general the same. 

Analogous to the technique presented 
in Differential Prediction (1), an iterative 
procedure was developed for selecting, 
progressively, the predictor which, when 
combined with the previously selected set, 
yields the highest index of absolute pre- 
diction efficiency. The process is con- 
tinued until the prespecified number of 
predictors has been so selected. It is as- 
sumed that this procedure yields an index 
of absolute prediction efficiency suffi- 
ciently close, for practical purposes, to 
that of the particular set of specified size 
which, for the particular data, would 
yield the highest index of absolute predic- 
tion efficiency. 

The numerical example presented in the 
computational section consists of opera- 
tions on the same matrices as those used 
in the computational section of Differen- 
tial Prediction, namely, a matrix of eight 
predictor variables and a matrix of ten 
criterion variables. The reader may there- 
by compare the results obtained for the 
two types of multiple prediction. 

The mathematical section comprises 
the development of the rationale for the 
selection procedure, and of formulas for 
obtaining the matrix of beta weights and 
the multiple correlations of the selected 
predictors with the criteria. This develop- 
ment eliminates the necessity for comput- 
ing the inverse of the predictor intercor- 
relations. 
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